Advances on Genome Duplication Distances
نویسندگان
چکیده
Given a phylogenetic tree involving Whole Genome Duplication events, we contribute to the problem of computing the rearrangement distance on a branch of a tree linking a duplication node d to a speciation node or a leaf s. In the case of a genome G at s containing exactly two copies of each gene, the genome halving problem is to find a perfectly duplicated genome D at d minimizing the rearrangement distance with G. We generalize the existing exact linear-time algorithm for genome halving to the case of a genome G with missing gene copies. In the case of a known ancestral duplicated genome D, we develop a greedy approach for computing the distance between G and D that is shown time-efficient and very accurate for both the rearrangement and DCJ
منابع مشابه
Evolutionary Patterns of Recently Emerged Animal Duplogs
Duplogs, or intraspecies paralogs, constitute the important portion of eukaryote genomes and serve as a major source of functional innovation. We conducted detailed analyses of recently emerged animal duplogs. Genome data of three vertebrate species (Homo sapiens, Mus musculus, and Danio rerio), Caenorhabditis elegans, and two Drosophila species (Drosophila melanogaster and D. pseudoobscura) we...
متن کاملTurning the clock back on ancient genome duplication.
Complete genome sequence data led rapidly to the conclusion that ancient genome duplications had shaped the genomes of the model organisms Saccharomyces cerevisiae and Arabidopsis thaliana. Recent contributions have gone on to refine date estimates for these duplications and, in the case of Arabidopsis, to infer additional, more ancient, rounds of duplication by reconstructing gene order before...
متن کاملGenome Halving and Double Distance with Losses
Given a phylogenetic tree involving whole genome duplication events, we contribute to solving the problem of computing the rearrangement and double cut-and-join (DCJ) distances on a branch of the tree linking a duplication node d to a speciation node or a leaf s. In the case of a genome G at s containing exactly two copies of each gene, the genome halving problem is to find a perfectly duplicat...
متن کاملReplication fork velocities at adjacent replication origins are coordinately modified during DNA replication in human cells.
The spatial organization of replicons into clusters is believed to be of critical importance for genome duplication in higher eukaryotes, but its functional organization still remains to be fully clarified. The coordinated activation of origins is insufficient on its own to account for a timely completion of genome duplication when interorigin distances vary significantly and fork velocities ar...
متن کاملMining EST databases to resolve evolutionary events in major crop species.
Using plant EST collections, we obtained 1392 potential gene duplicates across 8 plant species: Zea mays, Oryza sativa, Sorghum bicolor, Hordeum vulgare, Solanum tuberosum, Lycopersicon esculentum, Medicago truncatula, and Glycine max. We estimated the synonymous and nonsynonymous distances between each gene pair and identified two to three mixtures of normal distributions corresponding to one ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010